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Abstract 

In this paper, optimal linear transceiver designs for multi-hop amplify-and-forward (AF) Multiple- 
input Multiple-out (MIMO) relaying systems with Gaussian distributed channel estimation errors are 
investigated. Some commonly used transceiver design criteria such as weighted mean-square-error (MSE) 
minimization, mutual information maximization, worst MSE/MAX-MSE minimization and weighted 
sum-rate maximization, are taken into a single matrix-variate optimization problem. Exploiting ma- 
jorization theory and properties of matrix-variate functions, the optimal structures of robust transceivers 
are derived. Based on the optimal structures, the original transceiver design problems are reduced to 
much simpler problems with only scalar variables whose solutions are readily obtained by iterative water- 
filling algorithms. A number of existing transceiver design algorithms are found to be special cases of 
the proposed solution. The performance advantages of the proposed robust designs are demonstrated by 
the simulation results. 
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I. Introduction 

In order to satisfy the emerging requirements for high speed ubiquitous wireless communica- 
tions, cooperative communication has become one of the key parts in the future wireless standards 
such as LTE, IMT- Advanced, Winner project, etc. Specifically, relays are deployed to enhance the 
coverage of base station and to improve the quality of wireless links fl]. In general, relays can 
adopt different relaying strategies, e.g., amplify-and-forward (AF), decode-and-forward (DF) and 
compressed-and-forward (CF). Among these relaying strategies, AF scheme is the most attractive 
for practical implementation due to it low complexity and being independent of the underlying 
modulation. On other hand, it is well-established that employing multiple antennas provides 
spatial diversity and multiplexing gain in a wireless communication system. Consequently, to 
obtain the virtues of these two techniques, combination of AF transmission and multi-input 
multi-output (MIMO) systems (termed AF MIMO relaying) attracts more and more researchers' 
interest 0. 

Transceiver design for AF MIMO relaying systems has been reported in [|3l- [fT0l . There are 
various design criteria with different goals. The most common criteria are capacity maximization 
0, flU, (HI and data mean-square-error (MSE) minimization [EJ — [JH1 - In most of the previous 
works on transceiver design, channel state information (CSI) is assumed to be perfectly known. 
Unfortunately, channel estimation errors are inevitable in practical systems. To mitigate the effect 
on the performance of AF relaying systems, such channel estimation errors should be taken into 
account in the transceiver design process. 

In general, channel estimation errors can be modeled in two different ways: norm-bounded 
errors with known error bound and random errors with certain distribution. Correspondingly, 
robust transceiver designs can also be classified into two main categories: worst-case robust 
design for the norm-bounded errors and Bayesian robust design for the randomly distributed 
errors. For linear channel estimators, the estimation errors can be shown to be random with 
Gaussian distribution ifTTI . Recently, Bayesian robust transceiver design minimizing weighted 
MSE for dual-hop AF relaying systems under Gaussian channel estimation errors have been 
reported in |HH, [[121, |f!4l. 

In this paper, we consider the more general multi-hop AF relaying and derive the optimal 
transceiver structure in all nodes under channel estimation errors. Taking the Gaussian distributed 
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channel errors into account, the precoder at source, multiple forwarding matrices at the relays 
and equalizer at destination are jointly designed. Furthermore, in this work the transceiver 
design problems for several widely used design objectives (such as weighted MSE minimization, 
capacity maximization, worst case MSE minimization, and weighted sum-rate maximization) are 
formulated into a general optimization problem. Then the structure of the optimal solution for 
the general problem is derived based on majorization theory and properties of matrix-monotone 
functions. In general, the derived optimal structure is significantly different from its counterpart 
with perfect CSI. With the optimal structure, iterative water-filling solutions are proposed to 
obtain the remaining unknown parameters in the transceiver. Finally, the performance of the 
proposed robust designs are assessed by simulation results. 

The rest of the paper is organized as follows. In Section UH the signal model is introduced. 
Then the robust transceiver design problem is formulated in Section HHl covering weighted MSE 
minimization, mutual information maximization, MAX-MSE minimization and weighted sum- 
rate maximization. In Section [TV] the optimal structure of the robust transceiver is derived and 
the transceiver design problem is reduced to a problem of finding a set of diagonal matrices, 
which can be solved by iterative water- filling. The performance of the proposed robust designs is 
demonstrated by simulation results in Section |V] Finally, this paper is concluded in Section |VIl 

The following notations are used throughout this paper. Boldface lowercase letters denote 
vectors, while boldface uppercase letters denote matrices. The notation Z H denotes the Hermitian 
of the matrix Z, and Tr(Z) is the trace of the matrix Z. The symbol Im denotes the M x M 
identity matrix, while Om,n denotes the Mx N all zero matrix. The notation Z 1 / 2 is the Hermitian 
square root of the positive semidefinite matrix Z, such that Z 1 / 2 Z 1 / 2 = Z and Z 1 / 2 is also a 
Hermitian matrix. The symbol A»(Z) represents the i th largest eigenvalue of Z. The symbol ® 
denotes the Kronecker product. For two Hermitian matrices, C >z D means that C — D is a 
positive semi-definite matrix. The symbol A \ represents a rectangular diagonal matrix with 
decreasing diagonal elements. 

II. System Model 

In this paper, a multi-hop AF MIMO relaying system is considered. As shown in Fig. [Q one 
source with Ni antennas wants to communicate with the destination with M k antennas through 
K — 1 relays. For the k th relay, it has M k receive antennas and iV fc+1 transmit antennas. It is 
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obvious that the dual-hop AF MIMO relaying systems is one of its special cases when K = 2. 

At the source, a N x 1 data vector s with covariance matrix R s = E{ss H } = 1 N is transmitted 
through a precoder matrix Pi. The received signal xi at the first relay is xi = HxPxS + rix where 
Hx is the MIMO channel matrix between the source and the first relay, and nx is the additive 
Gaussian noise vector at the first relay with zero mean and covariance matrix R ni = crfI Ml . 

At the first relay, the received signal x x is first multiplied by a forwarding matrix P 2 and then 
the resultant signal is transmitted to the second relay. The received signal x 2 at the second relay 
is x 2 = H 2 P 2 xx + n 2 , where H 2 is the MIMO channel matrix between the first relay and the 
second relay, and n 2 is the additive Gaussian noise vector at the second relay with zero mean 
and covariance matrix R n2 = ofl^. Similarly, the received signal at k th relay can be written 
as 

x fc = HfcP fc x fc _x + n k (1) 

where is the channel for the k th hop, and is the additive Gaussian noise with zero mean 
and covariance matrix R nfc = <J$-M k - 

Finally, for a fT-hop AF MIMO relaying system, the received signal at the destination is 

K K-1 K 

y = [[]H t P fc ]s + II H < P 'M + n K, (2) 

k=l k=l l=k+l 

where n^Li^fe denotes x ■•• x In order to guarantee the transmitted data s can be 
recovered at the destination, it is assumed that N k and M k are greater than or equal to N 0. 

In practical systems, because of limited length of training sequences, channel estimation errors 
are inevitable. With channel estimation errors, we can write 

H fc = H fc + AH fc , (3) 

where is the estimated channel in the k th hop and AH k is the corresponding channel esti- 
mation error whose elements are zero mean Gaussian random variables. Moreover, the M k x Nk 

1 /2 1 /2 

matrix AH fc can be decomposed using the widely used Kronecker model AH fc = S fc ' H Wtk ^f k ' 
[fTT | - lfT3l . fTTSll . |[T6ll . The elements of the M k x N k matrix Hw,k arQ independent and identically 
distributed (i.i.d.) Gaussian random variables with zero mean and unit variance. The specific 
formulas of the row correlation matrix S fe and the column correlation matrix are determined 
by the training sequences and channel estimators being used [[TT|. lfT3l . 
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At the destination, a linear equalizer G is employed to detect the desired data vector s. The 
resulting data MSE matrix equals to 3>(G) = E{(Gy — s)(Gy — s) H }, where the expectation 
is taken with respect to random data, channel estimation errors, and noise. Following a similar 
derivation in dual-hop systems lfT2ll . the MSE matrix is derived to be 

*(G) = E{(Gy - s)(Gy - s) H } = Cffi^R^ + Tr(P^R XA ._ 1 P|* i ,)S x 

K K 

+ R nA .]G H + l N - [J] H fc P fc ] H G H - G fQ HfcPfc], (4) 

k=l k=l 

where the received signal covariance matrix R Xfc at the k th relay satisfies the following recursive 
formula 

R Xfc = H fc P fc R Xfc ^fif + Tr^R^Pf * fc )E fc + R. fe , (5) 
and R Xo = R s = 1^ represents the signal covariance matrix at the source. 

III. Transceiver Design Problems 

A. Objective Functions 

There are various performance metrics for transceiver designs. In the following, we focus on 
four widely used metrics. 

(1) First we consider the weighted MSE which can be written as 

Obj 1: Tr[W#(G)] (6) 

where the weighting matrix W is a positive semi-definite matrix IfTTl . Note that W is not 
restricted to be a diagonal matrix. Weighted MSE aims at minimizing the distortion between the 
recovered and the transmitted signal 0, lfT8l . |[P9ll . 

(2) Maximizing capacity is another most important and widely used performance metric for 
transceiver design. Denoting the received pilot for the channel estimation as r, the channel 
capacity between the source and destination is I(s;y|r) ll2~0ll . Unfortunately to the best of our 
knowledge, the exact capacity of MIMO channels with channel estimation errors is still open 
even for point-to-point MIMO systems |fT6l , Il20l . To proceed, a lower bound of capacity is 
usually exploited 0U|, [HI 

-log|#(G)| <I(s;y|r), (7) 
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where — log|$(G)| can be interpreted as the sum-rate of multiple transmitted data streams when 
linear equalizer G is employed. Notice that if perfect CSI is known, the inequality above becomes 
equality flU, |fT8l . For imperfect CSI, the tightness of this bound is extensively investigated 
in Il20l , Il22l . Based on this lower bound, the robust transceiver design maximizing mutual 
information can be replaced by minimizing the following objective function lfT6l . Il22l 

Obj 2: log|*(G)|. (8) 

(3) Notice that Obj 2 does not impose any fairness on the simultaneously transmitted multiple 
data streams, while Obj 1 imposes limited degree of fairness on the data as weighted MSE is 
only a linear operation. In general, for balancing the performance across different data streams, 
(e.g., minimizing the worst data stream MSE), the objective function is written as |[T8l 

Obj 3: Vi[d($(G))] (9) 

where if>i(*) is an increasing Schur-convex function and d(<fr(G)) = [[<&(G)]u [<&(G)] 2 ,2 • • - } T , 
with the symbol [Z]jj represents the (i,j) th entry of Z. 

(4) On the other hand, if a preference is given over a certain data streams, (e.g., loading more 
resources to the data streams with better channel state information), the objective function can 
be written as 

Obj 4: V 2 [d(*(G))] (10) 
where ij) 2 (*) ls an increasing Schur-concave function. 



B. Problem Formulation 

Although the above four criteria aim at different designs, the transceiver design optimization 
problem can be formulated into a single form: 

min /(*(G)) 

s.t. Tr(P fc R otfc _ 1 Pf) < P k , k = l,---,K (11) 

where the objective function /(•) is a real-valued matrix-variate function with <fr(G) as its 
argument. Notice that for all the four objectives described above, /(•) is a matrix-monotone 
increasing function (|24]|. 
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For (fTTI) . there is no constraint on the equalizer G. We can differentiate the trace of © with 
respect to G and obtain the LMMSE equalizer 

K 

Glmmse = fQ HfcPfcHHxP^R^PlHl + Tr(P i ,R XA ._ 1 P|* i ,)S K + R n J-\ (12) 

k=l 

with the property I12TTI 

*(G LM mse) 1 *(G). (13) 

Because /(•) is matrix-monotone increasing function, (fl"3T) implies that Glmmse minimizes the 
objective function in (ITTb . Substituting the optimal equalizer of (fl"2l) into 3?(G) in ©, <fr(G) 
equals to 



K 

$mse = Ijv - [nH fc P fc ] H [H i ,P x R XA ^PlHl 



fe=i 



+ ^(PkR^^PI^kJSk + RnJ^II HfcPfc ]' (14) 

fc=l 

For multi-hop AF MIMO relaying systems, the received signal at k th relay depends on the 
forwarding matrices at all preceding relays, making the power allocations at different relays 
couples with each other (as seen in the constrains of (fTTI)). and thus the problem (fTTI) difficult 
to solve. In order to simplify the problem, we define the following new variable in terms of P&: 

F k 4 P.K^ 2 1 (K-;/ 2 fl fe _ 1 F fc _ 1 F«_ 1 H«_ 1 K p ^ + I^J^QSU, (15) 

where K Ffe = Tr(F fe F^ \I/ fc )£ fc + <T 2 fc lM fe and is an unknown unitary matrix. The introduction 
of Qk is due to that fact that for a positive semi-definite matrix M, its square roots has the form 
M 1 / 2 Q where Q is an unitary matrix. Notice that F x = P^ With the new variable, the MSE 
matrix <E»mse is reformulated as 

K K 

<&mse = In ~ [J! Q fc n- 1/2 K Ffc 1/2 H fc F fc ] H [Il Q fc n- 1/2 K F ; /2 H fc F fc ] 
fe=i fe=i v v ' 

= I N -A*---A*A K ---A 1 . (16) 

Meanwhile, with the new variables F k , the corresponding power constraint in the k th hop can 
now be rewritten as 



Tr(F fc Ff ) < P k . (17) 
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It is obvious that with the new variables F k , the constraints become independent of each other. 
Putting (fT6l) and (flTT) into (flTT) . the transceiver design problem can be reformulated as 

P 1: min f(I N - 0) 

s.t. Tr(F fc Ff ) < P k , k = l,---,K 

& = Af---Af < A K ---A u Q*Q fc = I Mfe (18) 

From the definition of A k in (03 and noticing that K Ffc = Tr(F k F^^ k )'S k + o 2 n \ Mk , it 
can be seen that F k appears at multiple positions in the objective function. Therefore, the 
optimization problem is much more complicated than the counterpart with prefect CSI. Indeed, as 
demonstrated by existing works, robust transceiver design for point-to-point or dual-hop relaying 
MIMO systems is much more complicated and challenging than its counterpart with perfect CSI 

ed, m, 0, ma. 

IV. Optimal Structure of Robust Transceiver 

Based on the formulations of the objectives given in ©, ®, © and (flOl) . in Appendix [A] it 
is proved that P 1 has the following property. 

Property 1: At the optimal value of P 1, © must have the structure of 

= U n diag[A(0)]U* (19) 

where the vector A(0) = [A^©), • • ■ , Atv(0)] t with Aj(0) being the i th largest eigenvalue of 
0, and 

' U w for Obj 1 
U Arb for Obj 2 



U 



f2 



(20) 

Q F for Obj 3 



I at for Obj 4 

In (|20l) . the unitary matrix Uw is defined from the eigen-decomposition W = UwA\yU w 
with A w \, the matrix UArb is an arbitrary unitary matrix, and Q F is the unitary matrix which 
makes Q F diag[A(0)]Qp having identical diagonal elements. Furthermore, with this optimal 
structure, the objective function of P 1 equals to 

f(I N - U n diag[A(0)]U£) = flf[A(0)] (21) 

V v ' 

e 
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where g(m) is a monotobically decreasing and Schur-concave function with respective to A(0)Q 
Proof: See Appendix [A] ■ 

Based on Property 1, the objective function of <fl"8T ) can be directly replaced by gr[A(0)] and 
thus the optimization problem is simplified as 

P 2: min flf[A(0)] 

s.t. Q = A*--.A*A K .--A 1 , = U n diag[A(0)]U« 

Tr(F fc F*)<P fc , QfQ fc = I Mfc (22) 

where A^'s are defined in (fTBl) . In order to further simplify the optimization problem, we make 
use of the following two additional properties. 

Property 2: As «/(•) is a decreasing and Schur-concave function, the objective function in P 2 
satisfies 

3(A(0)) > g(hi(&) ■■■i N (e)] T ) (23) 

V v ' 

=7(0) 

with 7j (0) 4 A i (A|A^)A i (A^_ 1 A lc _ 1 ) • • • A,(A? A^, (24) 

with the equality in (l23l holds when the neighboring A^'s satisfy 

Va j =U Vii fc = 2, - - ■ , if (25) 

where unitary matrices Uyi t and are defined based on the following singular value decom- 
position A k = \J Ak A Ak V^ k with A Ak \. 
Proof: See Appendix El ■ 

Property 3: As gr(«) is a monotonically decreasing function with respective to its vector argu- 
ment, the optimal solutions of the optimization problem always occur on the boundary: 

Tr(F fc F*) = P k . (26) 

Furthermore, defining 

Vfk 4 Tr(F fe Ff * k )a k + < (27) 

'The specific expressions of <?(•) are given in Appendix lAl but they are not important for the derivation of the optimal 
structures. 
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with a k = Tr(S fc )/M fc which is a constant, ( |26l) is equivalent to 

Tr[F fe Ff (a k P k * k + a 2 n I Nk )}/ Vfk = P k . (28) 

Proof: See Appendix O ■ 

Based on Properties 2 and 3 , the optimal solution of the optimization problem (1221) is exactly 
the optimal solution of the following new optimization problem with different constraints 

P 3: min g[j(®)] 

s.t. Tr[F fe F* (a k P k * k + a 2 n l Nk )]/ Vfk = P k 
Q = A*--.A*A K .--A 1 
Q^Q k = I Mk , e = Undiag[ 7 (0)]U£ 
V^ fc = U Afc _ i; k = 2,---,K. (29) 

Noticing that g(m) is a monotonically decreasing function, solving P 3 gives the following 
structure for the optimal solution. 

Conclusion 1: Defining unitary matrices U^ fc and V^ fc based on the following singular value 
decomposition 

with A Hk \ and U<h = U n , (30) 

when SE^. oc I or S fc oc I, the optimal solutions of the optimization problem (1291 have the 
following structure 



F fc , pt = ^fc(A^)(a fc P fc * fc + <I^r 1/2 V<H fc ,7vA^U£ fc _ ii7V and Q fc , opt = l Mk , (31) 

where V-^ fci jv and XJ-n k ,N are the matrices consisting of the first A columns of V^ fc and U-^ fc , 
respectively, and A^ fc is a A x A unknown diagonal matrix. The scalar ^(Ajr fe ) is a function 
of A^.and equals to 

Gfc(A^) = </{l - a fe Tr[V» fcj7V (a fe P fc * fc + ^IiO" 1 ^**^** + <k)" 1/S V^Ay} 

= (32) 
Proof: See Appendix [D] ■ 
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In the optimal structure given by (|3T| ), the scalar variable £ k (Aj: k ) is only a function of the 
matrix A^ k and therefore the only unknown variable in (l3TT) is A^ fc . The remaining unknown 
diagonal elements of A^ fc can be obtained by water-filling alike solution as discussed in the 
next section. 
Special cases 

Several existing transceiver designs can be obtained as special cases of our proposed solution 
given by Conclusion 1. 

• When K = 2, S 2 oc Im 2 , and P 1 — In, the proposed solution for weighted MSE minimization 
reduces to the robust design for dual-hop AF MIMO relaying system without source precoder 

in ma. 

• When K = 2, ^ 2 oc Tv 2 , and Pi = I N , the proposed solution for weighted MSE minimization 
reduces to that for a dual-hop system in [fT2l|. 

• When K = 2, CSI being perfectly known, W = Ijy and Pi = Ijv, the proposed solution for 
robust design of weighted MSE minimization reduces to that in Q. 

• When K = 2, CSI being perfectly known and W = Ijv, the proposed solutions with Schur- 
convex/concave objective functions reduces to the that with perfect CSI given in (8). 

• When K — 2, CSI being perfectly known and P x = I N , the proposed solution for the robust 
design maximizing mutual information reduces to that in |@]|. 

• When K — 1, the proposed solution for weighted MSE minimization reduces to the point-to- 
point robust transceiver design given in |[T3l . 

• When K — 1, the proposed solution for maximizing mutual information reduces to that for 
dual-hop systems given in lfT6ll . 

Remark 1: For the general case when I Nk and S fc 9c Ijvf fc , to the best of our knowledge, 

the closed-form solution of the robust design is still open, even for point-to-point MIMO systems. 
The main difficulty comes from that fact that when SE^. 9c I Nk and S fe gt I Wfe , K. Fk /i]f k is a 
function of F k instead of a constant. However, when ty k 9c 1^ and S fc gk l Mk , K Ffc /?7/ fe in d30l) 
can be replaced by its upper bound 

K F J Vfk 1 P^^/iP^kW + <)E* + <./(P fc Ai(* fc )« fe + <)lM fe , (33) 

such that it is not a function of F^. Notice that the above inequality becomes equality when 

oc Ijv fc or S fc oc I Mfc . 
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The remaining unknown variables in (13TT) are only &j= k . Substituting the optimal structures 
given by Conclusion 1 into P 3 and defining [A-^J^ = h kti and [A^J^ = f kii for i — 1, • • • , N, 
the optimization problem for computing Ap k becomes 

min g[j(@)] 

s .t. 7 (0) = r 7l (0) ... 7 ^(©)] T , 7< (e) = ^=iflKi 

N 
i=l 

The specific expressions of <?(•), which have been derived in (1331) , (1361) , (1381 ) and (1391) of Ap- 
pendix [A] for the four different objectives. For all objectives, a widely used and computationally 
affordable solution of ft,i is the iterative water- filling [|23"1 . 

V. Simulation Results and Discussions 

In this section, the performance of the proposed robust designs are evaluated by simulations. 
For the purpose of comparison, the algorithms based on the estimated channel only (without 
taking the channel estimation errors into account) are also simulated. In the following, we 
consider the AF MEMO relaying systems where all nodes are equipped with 4 antennas. Further- 
more, the estimation error correlation matrices are chosen as the popular exponential model, i.e., 
[*fe]ij = o\ot\ % ~i\ and [£/ckj = where a and (3 are the correlation coefficients, and 

denotes the estimation error variance lfT2ll . lfl"5l . The estimated channels H fc 's, are generated based 
on the following complex Gaussian distributions H fe ~ CAfM k ,N k (QM k ,N k , (1 — ^D/^l^k ® ^1) 
[fT2|. such that channel realizations = + AH^ have unit variance. We define the signal- 
to-noise ratio (SNR) for the k th link as P k /cr^ . Each point in the following figures is an average 
of 10 4 trials. 

First, when K = 2, the proposed algorithms will reduce to the robust design for two-hop AF 
MIMO relaying systems. Fig. [2] shows the sum-rates of a two-hop AF MEMO relaying system 
for different algorithms when a = 0.6 and (3 = 0. It can be seen that two robust algorithms have 
better performance than the algorithm based on estimated CSI only. Furthermore, the performance 
of the proposed robust design is better than that the algorithm proposed by Rong in lfT4l and 
our algorithm has a lower complexity. In the following, we will focus on a three-hop AF MIMO 
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relaying system, i.e., K = 3. For MSE and bit error rate (BER), at the source node, four 
independent data streams are transmitted and in each data stream, N Data = 10 4 independent 
QPSK symbols are transmitted. 

With weighting matrix being arbitrarily selected as W = diag{[0.5 0.4 0.1 0.1]}, Fig. [3] 
shows the weighted MSE at the destination for the proposed algorithm and the algorithm based 
on estimated CSI only, when a = 0.6, (3 = 0, and P^ja\ k = 30dB. It can be seen that the 
performance of the proposed algorithm is always better than that based on the estimated CSI 
only. When a 2 e = 0, the performance of the two algorithms is the same as expected. 

Fig. H] shows the sum-rates resulting from the proposed algorithm and the algorithm based 
on estimated CSI only with different SNRs (SNR = P k /a* k ). It is assumed that the SNRs 
at various hops are the same. The correlation coefficients in the channel estimation errors are 
taken as a = 0.6 and j3 — 0. It can be seen that the performance of the proposed algorithm is 
better than that of the corresponding algorithm based on estimated CSI only. Furthermore, as 
the estimation errors increase, the performance gap between the two algorithms becomes larger. 
This result coincides with corresponding result in the MSE-based robust design, as shown in 
Fig.lH 

Fig. [5] shows the maximum MSE across four data streams with a = and (3 = 0.6. It can be 
seen that as SNR increases, the performance gain of the proposed robust design over the design 
with estimated CSI only becomes larger. Similarly, the performance gap is more apparent when 
cjg increases. 

Fig. [6] shows the BERs of the proposed robust designs with different performance metrics: 
sum MSE minimization (i.e., weighted MSE minimization with W = I), mutual information 
maximization and MAX-MSE minimization. Other parameters are taken as a = 0.6, (3 = and 
o"g = 0.004. It can be seen that the former two criteria have better performance than the latter 
one. Meanwhile, the capacity maximization based on estimated CSI only is given to show that 
the proposed robust designs are better than that of the design with estimated CSI only. 

VI. Conclusions 

Bayesian robust joint transceiver design for multi-hop AF MIMO relaying systems under 
channel estimation errors was considered. Various transceiver design criteria such as weighted 
MSE minimization, mutual information maximization, worst MSE minimization and weighted 
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sum-rate maximization were formulated into a single optimization problem. Using majorization 
theory and properties of matrix- variate functions, the optimal structure of transceivers was de- 
rived. Then, the transceiver design problems were greatly simplified and the remaining unknowns 
were obtained by iterative water-filling solutions. The performance of the proposed transceiver 
designs has been demonstrated by the simulation results. 

Appendix A 
Proof of Property 1 

Since the LMMSE equalizer G in (fl2)) minimizes the four objective functions, the Q*(G) 
in the four objectives should be replaced by 3>mse m (Q . Below we will derive the results 
separately for the four objectives. 

Obj 1: Putting <&mse into <&(G) in objective 1, and noticing that <&mse = I at — ©> me first 
objective function © equals to 

JV 

Tr[W*(G)] = Tr(W) - Tr(W0) > Tr(W) - ^ A;(W)A;(0) (35) 

i=l 

S v ' 

%(A(0)) 

where the inequality in the second line follows from the fact that for two positive semi-definite 
matrices W and ©, TV(W0) < £\ Aj(W)Aj(0) with \{Z) denoting the i th largest eigenvalue 
of Z. Furthermore, to make the equality in the second line of (1351) hold, © should have the 
following structure [24] = U w diag(A(0))U^ where A(0) = [Ai(0),--- ,Aat(0)] t and 
Uw is the unitary matrix containing the eigenvectors of W as columns. 

Based on the definition of Schur-convex function ||24||. it can be concluded that g(X(®)) 
is a Schur-concave function with respective to A(0). Furthermore, for two vectors v < u, 
#( v ) > d( u )- I n other words, g(») is a decreasing function. In this paper, for two vectors v > u 
represents Vi > ui. 

Obj 2: For the second objective function given by ®, it is directly a function of A(0) 

N 

\og\I N - 0| = £>g[l - Ai(0)] . (36) 

V v ' 

Obviously, the above the equality holds unconditionally. Therefore, we can write the structure of 
as = UArbdiag(A(0))UA rb where UArb is an arbitrary unitary matrix. Following a similar 
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derivation to that in the first objective, it can also be proved that gr(A(0)) is a decreasing 
Schur-concave function with respective to A(0). 

Obj 3: For the diagonal elements of the positive semi-definite matrix <&mse = Itv — 0, we have 
the following relationship ||24| 

d(Ijv -&)yl N - (Y] N Ai(0)/JV) (g> 1 N (37) 

with the equality holds if and only if [0]j,j = ^2f = i^i(®)/N, where Itv is the iV x 1 all one 
vector. 

For the third objective function in ©, as is decreasing and Schur-convex, the objective 

function satisfies |[T8l 

^i(d(Ijv - 0)) > 1>i (liv - ( X^AiC©)/^) g) ljv) , (38) 

with equality holds if and only if [0]t,j = Sili-M®)/^- Therefore, must have the following 
structure IfTSl = Q F diag(A(0))Qp where Qf is a unitary matrix such that has identical 
diagonal elements. 

Based on the definition that ls a decreasing and Schur-convex function, it can be directly 

proved that gr(A(0)) is a decreasing and Schur-concave function of A(0) E4l . 
Obj 4: Notice that for the positive semi-definite matrix <&mse = Itv — 0, d(Iyv — 0) -< A(Ijv— 0) 
IfTSl . Furthermore as i^ 2 (*) ls Schur-concave, we have 

if> 2 (d{I N - 0)) > i{> 2 ([l N - A(0)]), (39) 

S v ' 

=fl[A(©)] 

In order to make the equality in (|39l hold, we need [Q]^ = A.j(0), which means that is a 
diagonal matrix. Therefore, we can write = lArdiag(A(0))lAr. 

Since ip 2 (*) i s increasing and Schur-concave, it is obvious that ip 2 (l N — A(0)) is decreasing 
and Schur-concave with respective to A(0) [|24l . 

Appendix B 
Proof of Property 2 

First notice that for two matrices A and B with compatible dimension, \{AB) = \{BA) 
[|24l 9.A.l.a]. Together with the fact that for two positive semi-definite matrices A and B, 
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nLi^(AB) < rj* =1 \ i (A)X i (B) [24, 9.H.l.a], we haveE 

k k 

H \i(A* • • • • • • A 1 ) < H \(A^ ■ ■ ■ A\A K ■ ■ ■ A 2 )X i (A 1 Af ) fc = 1, • • • , N. (40) 

i=l t=l 

Repeating this process we have the following inequality 

k k 

II X *( A i ■ ■ ■ A kAk ■■■A 1 )< H A t (A|A g )\(A|„ 1 A J ,_ 1 ) ■ ■ ■ A^AfAQ . (41) 

i=l i=l S "~ v ' 

=7i(®) 

Based on (SB and 5.A.2.b in [|24]|, we directly have 

A(Af • • • A\A K • • • AO -< w [ 7l (0) 72(e) • • • 7 7v(e)] T 4 7 (0) (42) 

with equality holds if and only if the neighboring A^'s satisfy 

VA fc =U Afe _ i; k = 2,---,K (43) 

where Ua fc and Va*. are defined based on the following singular value decomposition A k = 
UA fc A^ fc V^ fc with A Ak \. As gr(«) is a decreasing and Schur-concave function, we have 
Sf(A(0)) > g(7(0)) d2H with equality holds if and only if (gS) holds. 

Appendix C 
Proof of Property 3 

Based on the definition of 7 i(0) in (l4TT) . 7 *(0) equals to 

K 

7i(©) = n^( F " fi " K F;H fc F fc )/[l + Aj(Ff fif Kp^H fc F fc )], (44) 

fc=i 

which is an increasing function of Aj(F^H^Kp^HfcFfc). Suppose at the moment that for the 
optimal Fjt, denoted by F fci0pt , transmission is not at maximum power, i.e., Tr(F fci0pt F^ opt ) < P k , 



then we have a = y -Pfe/Tr(F fci0pt Fj opt ) > 1. Defining F k = a¥ k>opt , it follows that 

Ff H»K-;H fc F fc =Fj opt H»(Tr(F fc , opt Fj opt * fc )S fc + a 2 J aHy 1 ^^ 

^F« opt Hf (Tr(F fc , opt Ff i0pt * fc )S fc + a^I)" 1 H fc F fc , opt . (45) 



K 



-1 

F 



fc,opt 



2 Note that in general is not a square matrix. 
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Note that A y B means that Aj(A) > \i(B) for all i, and therefore (|45l) implies 

A((ff HjiqjH^) > Aj (Ff opt Hf Kp^ opt H fc F fc)0pt ) . (46) 

Moreover, as 7i(0) is an increasing function of Aj(F^H^Kp^HjtFjt) as shown in (l44l) . we have 
7 (®lF fc =F fc )) — 7 (® lF fe =F fc opt ) • Together with the fact that g(») is a decreasing function, it is 
concluded that g [7 (@lF fe =F fe )] — 9 [t (® lF fe =F fc opt )] • It is obvious that the result contradicts 
with the optimality of F^ opt , and therefore a necessary condition for the optimal Ffe, Tr(FfcF^) = 
Pfe. Furthermore, it is obvious that Tr(FfcF^) = P& if and only if 

Tr[F k Ff (a k P k ^r k + a 2 n I)} = a k P k Tr(F k F* ¥ fc ) + <P fc . (47) 

Defining % = a fc Tr(F fc Fj«* fc )+< with «fc = Tr(£ fe )/M fc , (gU) can be rewritten as Tr[F k F k 1 (a k P k V k + 
cr^ I)] = P k rjf k - In other words, 

Tr[F fe F«KP fe * fe + = P fc . (48) 

Appendix D 
Proof of Conclusion 1 

Problem reformulation; As shown previously in (|44|) . 7,(0) is a function of A(F^H^Kp^H fc F fc ). 
Unfortunately, F k appears in multiple positions. In particular, K Ffe is a function of F fc which com- 
plicates the derivation of optimal solutions. In order to simplify the problem, A(F^H^Kp^H fc F fc ) 
is reformulated as 

A(F^Hf K F ;fi fc F fc ) =\[Ff(a k P k * k + <I/v fc )" 1/2 H^(K Ffc /%)^ /2 

x (K^a)"^^ + <IivJ- 1/2 P fc ], (49) 

V v ' 

where F^ is defined as 

Ffc = 1/v/^KPA + <Ii0 1/2 F fc . (50) 

The right hand side of ( |49l) is easier to handle than the left hand side. This is because when 
Si? k oc Ijv fc or Sfc oc Ijvf fc , 'H/,. is independent of Ffc. In the following, we will prove this in detail. 
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It is obvious that H k being independent of F fc is equivalent to K Ffc /?7j fc being independent 
of F fe . First consider ^ k oc I Nk , i.e., \I> fc = /3 k l Nk . With the definitions of K Ffc in ([151) and rjf k , 
K F k /Vf k equals to 

K F Jv fk = [/3 fc Tr(F fc F*)E fc + <I M J/[^a fc TV(F fe F») + <] 

= (&P fc £ fc + <I Mfe )/(a fc /3 fc P fc + <), (51) 

where the second equality is based on the fact that Tr(FfcF^) = P k . On the other hand, when 
S fc oc l Mk (i.e, S fc = a k l Mk ), K Fk /r] fk equals to 

= [a fe Tr(F fe F^ fc )Iiv fc + <I;v fc ]/K[Tr(F fe F** fc ) + = 1^. (52) 

Therefore, when & k oc 1^ or S fc oc 1^, K. Fk /r]f k is independent of F fe . 
Using the substitution (|50l) , the optimization problem (|29l) is reformulated as 

min flf[7(0)] 

it 

s.t. Tr(F fc F«) = P fc , Q£Q* = I^, 7 i(e) = n A *(F^^ fc F fc )/[l + A i (E?'W^ fc F fc )] 

fc=i 

= Af ■ ■ ■ A\A K • • ■ A 1; A k = Q k CH k F k ffn^ + I Mk )~ 1/2 n k F k 

= Uadiag[ 7 (e)]UH, U Ak = V^, k = 2,---,K, (53) 

where and are defined based on the singular value decomposition A k = U A k ^A k ^A k 
with A Ak \. 

Structure of optimal F k : For the optimal F k , denoted as I\ opt , based on the following singular 
value decompositions 

UkFkwt = U Mk A M( VM t with A Mk \ and U k = U-^A^V^ with A Hk \, 

(54) 

we can construct a matrix F^ 

F fc = V Hk A Xk V\ (55) 

where A Xfc is an unknown diagonal matrix with the same rank as A Mfc and A^ fc Ax fe /& = A Mfe , 
and the scalar b is chosen to make Tr(F fe F^) = P k hold. 
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Because H k is independent of unknown variable, using Lemma 12 in IfLSlI , we have 

Ffc'Hfc'HfcF fc y F k i opt 'H k 'H k F k>op t. (56) 

Taking eigenvalues of both sides, we have \ i (F%'Hf'H k F k ) > A* (Ffc,opt^3^HjbFfc,apt) flU- 
Since ji(G) is an increasing function of \ i (F k i 'H k 'H k F k ), we directly have that 7(®|f = p ) — 
7(@| Ffc=Ffc t ). Furthermore, as the objective function g(m) of (1531 ) is a decreasing function, 
we finally have g[j(@\f k= p k )] < 9h(®\t k =t k . opt )]- Because f\ opt is the optimal solution, 
Ffc, op t must be in the form of F k . Therefore, the structure of optimal is given by (1551) 
F fc)0 pt = V nk Ax k VM k . 

Notice that V Mfc is still unknown. Substituting F fc opt into the definition of A k in (|53l) , we 
have V Mfc = Q fc _ 1 U^ fc _ 1 . It can be seen that the value of Q k does not affect the value of 
in optimization problem (l53l) . Without loss of generality, it is set as Qfc opt = ±M fe - As a result, 
V Mt = \]<n k l and V Ml = Uo- For the considered system model, Rank(A Xfe ) < N, and 
thus only the N x N principal submatrix of A Xfc can be nonzero, which is denoted as A^ fc . In 
summary, F fc opt has the following structure 

F fc ,opt = V^^A^U^,^ _ uN (57) 

with \] Uo = U n . 

Structure of optimal F k : Based the relationship between F k and F fc given in (l50l) . 

F fc , opt = ^qf k {a k P k ^ k + a^I^J- 1/2 F fe!0pt . (58) 

Putting the structure of F fcj0pt in (1581) into rjf k in (|27T) . and rjf k can be solved to be 

Vf k = " a k Tr[Vl k>N (a k P k * k + all Nk Y l / 2 ^ k {a k P k ^ k + <I^)- 1/a V^Ay}. 

(59) 

As in (l59l ?7/ fc is a function of Ajr fe , it is denoted as rjf k = £, k (Ajr k ). 
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Fig. 1. Multi-hop amplify-and-forward MIMO relaying systems. 
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Fig. 2. Sum-rates of a two-hop AF relaying system when a = 0.6 and /3 — 0. 
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Fig. 3. Weighted MSE of the detected data in a three-hop AF relaying system with different degree of channel estimation 
errors, when a = 0.6 and Pk/cr„ k = 30dB. 
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Fig. 4. Sum-rates of a three-hop AF relaying system when a — 0.6 and f3 — 0. 
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-The algorithm based on estimated CSI only 
-The proposed robust design 
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Fig. 5. Maximum MSE among different received data streams in a three-hop relaying system with a — and j3 = 0.6. 




■Capacity maximization with estimated CSI only 
■Capacity maximization robust design 
■Sum-MSE minimization robust design 
■ MAX-MSE minimization robust design 
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Fig. 6. BERs of the proposed robust design with different design objectives, when a — 0.6, /3 = and al = 0.004. 
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